Usefulness of text-conditioning and a new database for text-dependent speaker recognition research

نویسندگان

Amitava Das

Gokul Chittaranjan

Gopala Krishna Anumanchipalli

چکیده

Text Dependent (TD) Speaker Recognition systems assume that the password to be uttered by the speaker is known to the system. As the password is known, the system can apply a password-specific model capturing the speaker dynamics well. This enables TD systems to perform better than textindependent systems. We present a variation of the TD systems, called text-conditioning, in which the password is uniquely chosen by each user. This delivers a higher level of discrimination since the linguistic and phonetic differences of the passwords themselves are exploited in separating the speakers. As the database for such a study was not publicly available, we built an extensive database for speaker recognition having such text-conditioning property. The database is tested with various speaker recognition trials. The results indicate that for the design of a practical TD speakerrecognition system, “text-conditioning” does offer a significant edge.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

BioSec Multimodal Biometric Database in Text-Dependent Speaker Recognition

In this paper we briefly describe the BioSec multimodal biometric database and analyze its use in automatic text-dependent speaker recognition research. The paper is structured into four parts: a short introduction to the problem of text-dependent speaker recognition; a brief review of other existing databases, including monomodal text-dependent speaker recognition databases and multimodal biom...

متن کامل

RSR2015: Database for Text-Dependent Speaker Verification using Multiple Pass-Phrases

This paper describes a new speech corpus, the RSR2015 database designed for text-dependent speaker recognition with scenario based on fixed pass-phrases. This database consists of over 71 hours of speech recorded from English speakers covering the diversity of accents spoken in Singapore. Acquisition has been done using a set of six portable devices including smart phones and tablets. The pool ...

متن کامل

The RSR2015: Database for Text-Dependent Speaker Verification using Multiple Pass-Phrases

متن کامل

A modified HME architecture for text-dependent speaker identification

A modified hierarchical mixtures of experts (HME) architecture is presented for text-dependent speaker identification. A new gating network is introduced to the original HME architecture for the use of instantaneous and transitional spectral information in text-dependent speaker identification. The statistical model underlying the proposed architecture is presented and learning is treated as a ...

متن کامل

Text-dependent speaker recognition by efficient capture of speaker dynamics in compressed time-frequency representations of speech

Prevalent speaker recognition methods use only spectralenvelope based features such as MFCC, ignoring the rich speaker identity information contained in the temporalspectral dynamics of the entire speech signal. We propose a new feature called compressed spectral dynamics or CSD for speaker recognition based on a compressed time-frequency representations of spoken passwords which effectively ca...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2008

Usefulness of text-conditioning and a new database for text-dependent speaker recognition research

نویسندگان

چکیده

منابع مشابه

BioSec Multimodal Biometric Database in Text-Dependent Speaker Recognition

RSR2015: Database for Text-Dependent Speaker Verification using Multiple Pass-Phrases

The RSR2015: Database for Text-Dependent Speaker Verification using Multiple Pass-Phrases

A modified HME architecture for text-dependent speaker identification

Text-dependent speaker recognition by efficient capture of speaker dynamics in compressed time-frequency representations of speech

عنوان ژورنال:

اشتراک گذاری